Bauer–Fike theorem

In mathematics, the Bauer–Fike theorem is a standard result in the perturbation theory of the eigenvalue of a complex-valued diagonalizable matrix. In its substance, it states an absolute upper bound for the deviation of one perturbed matrix eigenvalue from a properly chosen eigenvalue of the exact matrix. Informally speaking, what it says is that the sensitivity of the eigenvalues is estimated by the condition number of the matrix of eigenvectors.

1 Theorem (Friedrich L. Bauer, C.T.Fike – 1960)
- 1.1 Proof
2 Theorem (Friedrich L. Bauer, C.T.Fike – 1960) (alternative statement)
- 2.1 Proof
3 Corollary
- 3.1 Proof
4 Remark
5 References

Theorem (Friedrich L. Bauer, C.T.Fike – 1960)

Let $A\in\mathbb{C}^{n,n}$ be a diagonalizable matrix, and $V\in\mathbb{C}^{n,n}$ be the non singular eigenvector matrix such that $A=V\Lambda V^{-1}$ . Be moreover $\mu$ an eigenvalue of the matrix $A%2B\delta A$ ; then an eigenvalue $\lambda\in\sigma(A)$ exists such that:

$|\lambda-\mu|\leq\kappa_p (V)\|\delta A\|_p$

where $\kappa_p(V)=\|V\|_p\|V^{-1}\|_p$ is the usual condition number in p-norm.

Proof

If $\mu\in\sigma(A)$ , we can choose $\lambda=\mu$ and the thesis is trivially verified (since $\kappa_p(V)\geq 1$ ).

So, be $\mu\notin\sigma(A)$ . Then $\det(\Lambda-\mu I)\ \ne\ 0$ . $\mu$ being an eigenvalue of $A%2B\delta A$ , we have $\det(A%2B\delta A-\mu I)=0$ and so

$0=\det(V^{-1})\det(A%2B\delta A-\mu I)\det(V)=\det(\Lambda%2BV^{-1}\delta AV-\mu I)$

$=\det(\Lambda-\mu I)\det[(\Lambda-\mu I)^{-1}V^{-1}\delta AV %2BI]$

and, since $\det(\Lambda-\mu I)\ \ne\ 0$ as stated above, we must have

$\det[(\Lambda-\mu I)^{-1}V^{-1}\delta AV %2BI]=\ 0$

which reveals the value −1 to be an eigenvalue of the matrix $(\Lambda-\mu I)^{-1}V^{-1}\delta AV$ .

For each consistent matrix norm, we have $|\lambda|\leq\|A\|$ , so, all p-norms being consistent, we can write:

$1\leq\|(\Lambda-\mu I)^{-1}V^{-1}\delta AV\|_p\leq\|(\Lambda-\mu I)^{-1}\|_p\|V^{-1}\|_p\|V\|_p\|\delta A\|_p$

$=\|(\Lambda-\mu I)^{-1}\|_p\ \kappa_p(V)\|\delta A\|_p$

But $(\Lambda-\mu I)^{-1}$ being a diagonal matrix, the p-norm is easily computed, and yields:

$\|(\Lambda-\mu I)^{-1}\|_p\ =\max_{\|\mathbf{x}\|_p\ne 0} \frac{\|(\Lambda-\mu I)^{-1}\mathbf{x}\|_p}{\|\mathbf{x}\|_p}\$

$=\max_{\lambda\in\sigma(A)}\frac{1}{|\lambda -\mu|}\ =\ \frac{1}{\min_{\lambda\in\sigma(A)}|\lambda-\mu|}$

whence:

$\min_{\lambda\in\sigma(A)}|\lambda-\mu|\leq\ \kappa_p(V)\|\delta A\|_p.\,$

The theorem can also be reformulated to better suit numerical methods. In fact, dealing with real eigensystem problems, one often has an exact matrix $A$ , but knows only an approximate eigenvalue-eigenvector couple, ( $\tilde{\lambda}$ , $\tilde{\mathbf{v}}$ ), and needs to bound the error. The following version comes in help.

Theorem (Friedrich L. Bauer, C.T.Fike – 1960) (alternative statement)

Let $A\in\mathbb{C}^{n,n}$ be a diagonalizable matrix, and be $V\in\mathbb{C}^{n,n}$ the non singular eigenvector matrix such as $A=V\Lambda V^{-1}$ . Be moreover ( $\tilde{\lambda}$ , $\mathbf{\tilde{v}}$ ) an approximate eigenvalue-eigenvector couple, and $\mathbf{r}=A\mathbf{\tilde{v}}-\tilde{\lambda}\mathbf{\tilde{v}}$ ; then an eigenvalue $\lambda\in\sigma(A)$ exists such that:

$|\lambda-\tilde{\lambda}|\leq\kappa_p (V)\frac{\|\mathbf{r}\|_p}{\|\mathbf{\tilde{v}}\|_p}$

where $\kappa_p(V)=\|V\|_p\|V^{-1}\|_p$ is the usual condition number in p-norm.

Proof

We solve this problem with Tarık's method: m $\tilde{\lambda}\notin\sigma(A)$ (otherwise, we can choose $\lambda=\tilde{\lambda}$ and theorem is proven, since $\kappa_p(V)\geq 1$ ). Then $(A-\tilde{\lambda} I)^{-1}$ exists, so we can write:

$\mathbf{\tilde{v}}=(A-\tilde{\lambda} I)^{-1}\mathbf{r}=V(D-\tilde{\lambda} I)^{-1}V^{-1}\mathbf{r}$

since $A$ is diagonalizable; taking the p-norm of both sides, we obtain:

$\|\mathbf{\tilde{v}}\|_p=\|V(D-\tilde{\lambda} I)^{-1}V^{-1}\mathbf{r}\|_p \leq \|V\|_p \|(D-\tilde{\lambda} I)^{-1}\|_p \|V^{-1}\|_p \|\mathbf{r}\|_p$

$=\kappa_p(V)\|(D-\tilde{\lambda} I)^{-1}\|_p \|\mathbf{r}\|_p.$

But, since $(D-\tilde{\lambda} I)^{-1}$ is a diagonal matrix, the p-norm is easily computed, and yields:

$\|(D-\tilde{\lambda} I)^{-1}\|_p=\max_{\|\mathbf{x}\|_p \ne 0}\frac{\|(D-\tilde{\lambda} I)^{-1}\mathbf{x}\|_p}{\|\mathbf{x}\|_p}$

$=\max_{\lambda\in\sigma(A)} \frac{1}{|\lambda-\tilde{\lambda}|}=\frac{1}{\min_{\lambda\in\sigma(A)}|\lambda-\tilde{\lambda}|}$

whence:

$\min_{\lambda\in\sigma(A)}|\lambda-\tilde{\lambda}|\leq\kappa_p(V)\frac{\|\mathbf{r}\|_p}{\|\mathbf{\tilde{v}}\|_p}.$

The Bauer–Fike theorem, in both versions, yields an absolute bound. The following corollary, which, besides all the hypothesis of Bauer–Fike theorem, requires also the non-singularity of A, turns out to be useful whenever a relative bound is needed.

Corollary

Be $A\in\mathbb{C}^{n,n}$ a non-singular, diagonalizable matrix, and be $V\in\mathbb{C}^{n,n}$ the non singular eigenvector matrix such as $A=V\Lambda V^{-1}$ . Be moreover $\mu$ an eigenvalue of the matrix $A%2B\delta A$ ; then an eigenvalue $\lambda\in\sigma(A)$ exists such that:

$\frac{|\lambda-\mu|}{|\lambda|}\leq\kappa_p (V)\|A^{-1}\delta A\|_p$

(Note: $\|A^{-1}\delta A\|$ can be formally viewed as the "relative variation of A", just as $|\lambda-\mu||\lambda|^{-1}$ is the relative variation of λ.)

Proof

Since μ is an eigenvalue of (A+δA) and $det(A)\ne 0$ , we have, left-multiplying by $-A^{-1}$ :

$-A^{-1}(A%2B\delta A)\mathbf{v}=-\mu A^{-1}\mathbf{v}$

that is, putting $\tilde{A}=\mu A^{-1}$ and $\tilde{\delta A}=-A^{-1}\delta A$ :

$(\tilde{A}%2B\tilde{\delta A}-I)\mathbf{v}=\mathbf{0}$

which means that $\tilde{\mu}=1$ is an eigenvalue of $(\tilde{A}%2B\tilde{\delta A})$ , with $\mathbf{v}$ eigenvector. Now, the eigenvalues of $\tilde{A}$ are $\frac{\mu}{\lambda_i}$ , while its eigenvector matrix is the same as A. Applying the Bauer–Fike theorem to the matrix $\tilde{A}%2B\tilde{\delta A}$ and to its eigenvalue $\tilde{\mu}=1$ , we obtain:

$\min_{\lambda\in\sigma(A)}\left|\frac{\mu}{\lambda}-1\right|=\min_{\lambda\in\sigma(A)}\frac{|\lambda-\mu|}{|\lambda|}\leq\kappa_p (V)\|A^{-1}\delta A\|_p$

Remark

If A is normal, V is a unitary matrix, and $\|V\|_2=\|V^{-1}\|_2=1$ , so that $\kappa_2(V)=1$ .

The Bauer–Fike theorem then becomes:

$\exists\lambda\in\sigma(A): |\lambda-\mu|\leq\|\delta A\|_2$

( $\exists\lambda\in\sigma(A): |\lambda-\tilde{\lambda}|\leq\frac{\|\mathbf{r}\|_2}{\|\mathbf{\tilde{v}}\|_2}$ in the alternative formulation)

which obviously remains true if A is a Hermitian matrix. In this case, however, a much stronger result holds, known as the Weyl theorem.

References

F. L. Bauer and C. T. Fike. Norms and exclusion theorems. Numer. Math. 2 (1960), 137–141.
S. C. Eisenstat and I. C. F. Ipsen. Three absolute perturbation bounds for matrix eigenvalues imply relative bounds. SIAM Journal on Matrix Analysis and Applications Vol. 20, N. 1 (1998), 149–158

Bauer–Fike theorem

Contents

Theorem (Friedrich L. Bauer, C.T.Fike – 1960)

Proof

Theorem (Friedrich L. Bauer, C.T.Fike – 1960) (alternative statement)

Proof

Corollary

Proof

Remark

References